AITopics | game theory

Collaborating Authors

game theory

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning to Bid in Repeated Second-Price Auctions with Dynamic Values and Aggregated Feedback

Heymann, Benjamin, Sakhi, Otmane

arXiv.org Machine LearningMay-28-2026

We study the problem of learning to bid when the bidder's value is dynamic, i.e., when the current value depends on past outcomes. Specifically, we consider a bidder participating in repeated second-price auctions whose value depends on the time elapsed since their last successful bid, with auctions arriving in continuous time and only aggregated feedback revealed at the end of the horizon. Such a bidder must (1) balance the immediate benefit of winning the current auction against its impact on future values and (2) learn unknown environmental parameters. We derive regret bounds for a class of learning methods that combine plug-in estimators with a differential-equation characterization of the optimal policy, and show that a specific confidence bound algorithm learns the optimal policy with a near optimal regret of $\widetilde{O}(\log N)$ for piecewise linear primitives, and $\widetilde{O}(N^{1/3})$ for general, smooth primitives, achieving these regrets without explicit randomization. These theoretical results are supported by numerical experiments.

artificial intelligence, auction, machine learning, (19 more...)

arXiv.org Machine Learning

2605.28133

Country: Europe > United Kingdom (0.28)

Genre: Research Report (0.64)

Industry: Marketing (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Support-aware offline policy selection for advertising marketplaces

Shekhar, Prashant, Howard, Caroline

arXiv.org Machine LearningMay-22-2026

Logged advertising auctions make offline reserve-price evaluation attractive but risky. Replay tables can identify policies with large apparent yield gains, yet they can also hide weak threshold support, multiple-comparison effects, subgroup harm, and bidder-response uncertainty. Existing replay and off-policy evaluation methods estimate or rank policy values, but they do not directly answer the operational question of whether the available evidence is strong enough to justify validation. This paper develops a support-aware offline decision framework for reserve-policy selection. Rather than outputting a single point-estimate winner, the framework converts logged evidence into a conservative decision object consisting of certified policies, statistically dominated alternatives, and unresolved candidates requiring further validation. The main theoretical result gives a unified finite-catalog guarantee showing that, under simultaneous uncertainty control and conservative support gates, the framework preserves the best gate-passing policy while eliminating only policies with certified regret. Supporting results characterize support-localized replay generalization, establish information-theoretic threshold-resolution limits, and quantify when heterogeneous bidder response can overturn localized replay rankings. Experiments on iPinYou real-time-bidding logs show that the leading reserve rule achieves a 47.66% replay lift in season two, a 40.71% simultaneous lower-bound lift, and a 43.87% frozen out-of-time replay lift in season three. The framework reduces a 19-policy catalog to a two-policy validation shortlist while certifying non-harm across 44 advertiser, exchange, and region segments. The results support the central claim that offline reserve-policy evaluation should produce certified validation decisions rather than point-estimate rankings alone.

artificial intelligence, machine learning, replay lift, (19 more...)

arXiv.org Machine Learning

2605.21736

Country: North America > United States > Florida > Volusia County > Daytona Beach (0.40)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Game Theory (0.67)

Add feedback

Do Not Trust The Auctioneer: Learning to Bid in Feedback-Manipulated Auctions

Foscari, Luigi, Tullii, Matilde, Perchet, Vianney

arXiv.org Machine LearningMay-22-2026

Shilling is the use of artificial bids to make competition appear stronger and push prices upward. We study repeated first-price auctions in which shilling affects feedback but not allocation: the learner wins or loses against the real competing bid, but after a loss observes the maximum of the real bid and an independent shill bid. Thus the manipulation changes what the learner observes and hence how it learns to bid, without changing the outcome of the current auction. We analyze regret with respect to the best bid benchmark, assuming that the shill-bid distribution is known. Even then, shilling can mask the real bid, while useful side information appears only through intermittent low-shill events. Our algorithm combines a robust interval-elimination branch, which ignores the shilled report and achieves the dynamic-pricing rate $\tilde{\mathcal{O}}(T^{2/3})$, with an optimistic branch that debiases losing-side reports and exploits the resulting suffix information when it is reliable and achieves the first-price auctions rate $\tilde{\mathcal{O}}(\sqrt{T})$. A validation and racing procedure lets the algorithm use these optimistic updates without knowing the right scale or feedback geometry in advance. We complement the upper bounds with a matching lower bound, up to logarithmic factors, in the single-active-region case. Overall, the results show that even feedback-only shilling can sharply alter the statistical difficulty of repeated bidding.

artificial intelligence, information, machine learning, (20 more...)

arXiv.org Machine Learning

2605.22438

Genre: Research Report > New Finding (0.48)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Game Theory (0.69)

Add feedback

Prediction-Intervention Games and Invariant Sets

Kühne, Linus, Schur, Felix, Peters, Jonas

arXiv.org Machine LearningMay-19-2026

We consider the following two-player game: using observational data, the leader chooses a prediction function for a response variable $Y$ from given covariates. The follower then reacts with an intervention on some covariates in the underlying structural causal model to maximize their own objective. The leader knows the intervention targets, but may have limited knowledge of the follower's objective. We call this setup a prediction-intervention game, a special case of a Stackelberg game. Finding an optimal strategy for the leader is generally difficult. To avoid severe performance loss, the leader may base their prediction on the causal parents of $Y$, or more generally on an invariant subset of covariates. We prove, for two common classes of follower objectives, that predictors based on the stable blanket, a specific invariant subset, are always better or as good as those based on the causal parents. We further upper bound the leader's post-intervention risk by a worst-case risk over allowed interventions and strengthen existing distribution generalization results to analyze this bound: we give sufficient conditions under which stable-blanket predictors are worst-case optimal, and show by examples that these conditions cannot in general be dropped. Finally, we discuss practical strategies for settings with known and unknown graph, and test them on simulated and real-world data.

artificial intelligence, follower, machine learning, (18 more...)

arXiv.org Machine Learning

2605.16828

Country: Europe > Switzerland (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Game Theory (0.74)

Add feedback

Why are World Cup tickets so expensive?

Al JazeeraMay-14-2026, 07:42:53 GMT

Why are World Cup tickets so expensive? Game Theory Why are World Cup tickets so expensive? The 2026 World Cup is not only the biggest World Cup in history. With dynamic pricing and rising travel costs, the game may be global, but access isn't to your average football fan. So who gets to be in the stands?

artificial intelligence, game theory, live navigation menu news show, (6 more...)

Al Jazeera

Country:

Asia (1.00)
North America > United States (0.43)
North America > Central America (0.43)
North America > Canada (0.43)

Industry: Leisure & Entertainment > Games > Computer Games (0.40)

Technology:

Information Technology > Game Theory (0.63)
Information Technology > Artificial Intelligence > Games (0.40)

Add feedback

Attributions All the Way Down? The Metagame of Interpretability

Baniecki, Hubert, Biecek, Przemyslaw, Fumagalli, Fabian

arXiv.org Machine LearningMay-8-2026

We introduce the metagame, a conceptual framework for quantifying second-order interaction effects of model explanations. For any first-order attribution $ϕ(f)$ explaining a model $f$, we measure the directional influence of feature $j$ on the attribution of feature $i$, denoted as meta-attribution $φ_{j \to i}(f)$, by treating the attribution method itself as a cooperative game and computing its Shapley value. Theoretically, we prove that attributions hierarchically decompose into meta-attributions, and establish these as directional extensions of existing interaction indices. Empirically, we demonstrate that the metagame delivers insights across diverse interpretability applications: (i) quantifying token interactions in instruction-tuned language models, (ii) explaining cross-modal similarity in vision-language encoders, and (iii) interpreting text-to-image concepts in multimodal diffusion transformers.

large language model, machine learning, natural language, (21 more...)

arXiv.org Machine Learning

2605.06295

Genre: Research Report (1.00)

Industry:

Health & Medicine (1.00)
Leisure & Entertainment (0.93)
Government (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
(2 more...)

Add feedback

The World Cup & Passport privilege

Al JazeeraMay-3-2026, 08:04:51 GMT

Game Theory: Who gets to go to the 2026 World Cup? Who actually gets to go to the World Cup? With US President Donald Trump's strict immigration policies, some fans may never make it past the American border. Because while teams qualify on merit, passports don't. Al Jazeera's Samantha Johnson explains. The Masters: Golf's segregated past Are Iran's athletes political pawns?

artificial intelligence, game theory, live navigation menu news show, (6 more...)

Al Jazeera

Country:

North America > United States (1.00)
Asia > Middle East > Iran (0.32)

Industry:

Leisure & Entertainment > Games (1.00)
Government > Immigration & Customs (0.95)
Government > Regional Government > North America Government > United States Government (0.78)

Technology:

Information Technology > Game Theory (0.65)
Information Technology > Artificial Intelligence > Games (0.40)

Add feedback

fcd3909db30887ce1da519c4468db668-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:54:54 GMT

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (0.67)

Genre: Research Report (0.68)

Industry:

Banking & Finance (0.67)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Game Theory (0.68)
Information Technology > Data Science > Data Mining (0.67)

Add feedback

fcd3909db30887ce1da519c4468db668-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:54:50 GMT

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.92)
Europe (0.67)

Genre: Research Report (0.68)

Industry:

Banking & Finance (0.67)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Game Theory (0.68)
Information Technology > Data Science > Data Mining (0.67)

Add feedback

Strategic stability under regularized learning in games

Neural Information Processing SystemsApr-30-2026, 08:39:11 GMT

In this paper, we examine the long-run behavior of regularized, no-regret learning in1 finite games. A well-known result in the field states that the empirical frequencies2 of no-regret play converge to the game's set of coarse correlated equilibria; however,3 our understanding of how the players' actual strategies evolve over time is much4 more limited - and, in many cases, non-existent. This issue is exacerbated by5 a series of recent results showing that only strict Nash equilibria are stable and6 attracting under regularized learning, thus making the relation between learning7 and pointwise solution concepts particularly elusive. In lieu of this, we take a more8 general approach and instead seek to characterize the setwise rationality properties9 of the players' day-to-day play. To that end, we focus on one of the most stringent10 criteria of setwise strategic stability, namely that any unilateral deviation from the11 set in question incurs a cost to the deviator - a property known as closedness under12 better replies (club).

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Europe (0.28)

Genre:

Workflow (0.68)
Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Mathematics of Computing (0.64)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback